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(54) Bitstream quality analyzer 

(57) A bitstream quality analysis system includes a 
demultiplexer (180), a bitstream quality analyzer (190), 
and a graphical user interface (200). The demultiplexer 
receives a bitstream and separates from the bitstream 
at least one elementary bitstream that includes a video 
elementary bitstream. The bitstream quality analyzer 
receives the video elementary bitstream from the 
demultiplexer and parses the video elementary bit- 
stream to extract parameters characterizing the video 
elementary bitstream. The bitstream quality analyzer 
provides the extracted parameters to the graphical user 
interface, which displays the extracted parameters char- 



acterizing the video elementary bitstream. A user can 
monitor the video elementary bitstream at varying levels 
of detail, and can perform varying levels of quality anal- 
ysis on the video elementary bitstream. Dependent on 
the level of monitoring an quality analysis selected by 
the user, the bitstream quality analyzer provides infor- 
mation relating to buffer usage and coding efficiency 
and can predict the quality of the fully decoded video 
sequence without fully decoding the video elementary 
bitstream and without using the original video 
sequence. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention is directed to methods and 
apparatus for monitoring and analyzing a video bit- 
stream. More particularly the present invention is 
directed to methods and apparatus for monitoring an 
MPEG video bitstream and providing metrics and anal- 
ysis of the MPEG video bitstream at varying levels of 
detail based on the level of monitoring and analysts 
selected by a user. Levels of monitoring and analysis 
include monitoring the various characteristics and 
attributes of the MPEG video bitstream, testing for 
MPEG compliance and real-time decidability, and pre- 
dicting the visual quality of decompressed video 
sequences contained in the MPEG video bitstream. 

2. Discussion of the Related Art 

The MPEG (Moving Picture Experts Group) stand- 
arete are some of the most widely accepted digital video 
encoding standards and include MPEG-1 and MPEG-2 
(MPEG-4 is still in development). MPEG-2 supports 
higher video quality at higher bit rates than MPEG-1, 
allows more options in encoding and decoding formats 
than MPEG-1, and is capable of handling existing 
MPEG-1 encoded bitstreams. MPEG-2 can also handle 
interlaced video and supports the scalable syntax, 
which allows an encoder to create enhancement layers 
that can both increase the quality and the spatial and 
temporal resolution of the motion picture. Such scalabil- 
ity is very useful in video applications involving telecom- 
munications, video service with multiple qualities, video 
on ATM networks, and HDTV. 

Fig. 1 illustrates a conventional MPEG (the term 
MPEG is used to denote MPEG-1 or MPEG-2, unless 
specifically noted otherwise) decoder 5. The decoder 5 
includes a system decoder 10 that parses the MPEG 
bitstream and feeds separated audio and video bit- 
streams (containing compressed audio and video data), 
along with timing information, to an audio decoder 15 
and a video decoder 20. The audio and video decoders 
1 5, 20 then decode their respective audio and video bit- 
streams to provide output data. 

As shown in Fig. 2, an MPEG video bitstream 
includes six layers: a video sequence layer, a group of 
pictures layer, a picture layer, a slice layer, a macrobiock 
layer, and a block layer. In Fig. 2, those portions of the 
video sequence that are denoted parenthetically are 
optional. The upper level layers of an MPEG video bit- 
stream differ, dependent upon whether the video 
sequence is an MPEG-1 video sequence or an MPEG- 
2 video sequence. Both MPEG-1 and MPEG-2 video 
sequences start with a sequence header and end in a 
sequence end code. The MPEG-1 sequence header is 



followed by at least one or more groups of pictures, 
each group of pictures starting with a group of pictures 
header and including at least one picture, with each pic- 
ture starting with a picture header and including one or 

5 more slices. In contrast, the MPEG-2 sequence header 
is followed by sequence extension information, and 
therefore, an MPEG-2 video sequence is easily differen- 
tiated from an MPEG-1 video sequence. The sequence 
extension is followed by a GOP header (optional), one 

70 or more pictures and the sequence end code. As 
shown, the GOP header may be followed by user data 
(optional). Each picture includes a picture header, fol- 
lowed by picture coding extension information, exten- 
sion and user data (optional) and at least one slice. 

is Regardless of whether the sequence is an MPEG-1 
video sequence or an MPEG-2 video sequence, each 
slice is made up of one or more macroblocks. A macrob- 
iock is made up of several 8 by 8 Discrete Cosine Trans- 
form (DCT) coefficient blocks along with a number of 

20 other parameters, such as quantization scale code, 
motion vectors, etc. (See ISO/IEC Standard 13818-2). 
The number of blocks forming a macrobiock can vary, 
dependent on whether the video sequence is MPEG-1 
or MPEG-2. For MPEG-2, the number of blocks can 

25 also vary dependent on the chroma format. Each block 
contains quantized DCT coefficient values that are vari- 
able-length encoded. Further details concerning MPEG 
are widely available, including MPEG VIDEO COM- 
PRESSION STANDARD, Joan L Mitchell et al. ( Chap- 

30 man & Hill, 1997, ISBN 0-412-08771-5, which is 
incorporated herein by reference. 

A simplified functional diagram of a conventional 
video decoder 50 is illustrated in Fig. 3. Conventional 
video decoders are generally implemented as a black 

35 box' in which the video decoder 50 receives a video bit- 
stream and provides output video data. Within the black 
box, the decoder 50 decodes the video bitstream on a 
macroblock-by-macroblock basis by performing the fol- 
lowing operations. First, the video decoder performs 

40 Inverse Variable Length Coding (IVLC) to expand the 
variable-length encoded data. Then, the video decoder 
dequantizes the quantized DCT coefficients (termed 
Inverse Quantization of DCT coefficients (IQDCT)) 
based on the appropriate quantization tables (matrices) 

45 and the quantization scale to reconstruct (as near as 
possible) DCT coefficients. Next, the video decoder per- 
forms Inverse DCT (IDCT) calculations and motion 
compensation to reconstruct the pixel data, which is 
provided as output video data at the appropriate time. 

so Because the MPEG standards have been so widely 
accepted and their use so varied, and because many 
different vendors make or sell MPEG products, there is 
a significant need to both monitor and analyze the qual- 
ity of an MPEG bitstream to verify at least minimal con- 

S5 formance with the standards. Due to the greater 
bandwidth demands of the video bitstream versus the 
audio bitstream, this need is particularly acute with 
respect to the MPEG video bitstream. For exanple. 
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designers of MPEG video encoders want to know that 
their output is MPEG compliant, while designers of 
MPEG video decoders want to know whether the data 
they are using to develop and test their design is, in fact, 
compliant with the standard. Furthermore, although the 5 
MPEG video syntax is specified in the standards, subtle 
differences in interpretation of the standards can exist 
while still conforming to the standard, and therefore 
interoperability problems are almost inevitable. Accord- 
ingly, designers of both encoders and decoders need a 10 
tool to analyze an MPEG video bitstream and isolate 
irrter-operability problems. 

Beyond this level of monitoring and quality analysis, 
there is also a need to provide information and quality 
analysis relating to those parameters over which is 
designers of MPEG encoders have some choice. These 
parameters may or may not be optimal, dependent on 
the particular encoding process used and the underly- 
ing video material that was encoded. Such information 
and quality analysis can provide feedback to help 20 
designers develop more efficient encoding algorithms 
that optimize picture quality for a given number of bits. 

There is also a need to monitor and assess the 
quality of the resulting video output data as it is seen 
after it has been received and decoded by an MPEG 25 
video decoder. This information is useful to designers of 
MPEG encoders as well as to broadcasters of MPEG 
transmissions. For example, a broadcasting station 
could use the information to save bandwidth if the qual- 
ity of the video output data was higher than necessary, 30 
and to increase the bandwidth of the transmitted signal 
if the quality of video output data was poor. Although 
conventional systems are available to assess the quality 
of video output data after encoding and decoding proc- 
esses, these systems rely on a comparison of the 35 
decoded output video data to the original input video 
data. This sort of comparison is computationally expen- 
sive inasmuch as it requires the full decoding of the 
received video bitstream and a bit-by-bit comparison 
with the input data. The time required for such a bit-by- 40 
bit analysis can be prohibitive. In addition, from a 
decoding point of view, the original input video data may 
simply not be available to the recipient of the MPEG bit- 
stream. 

45 

Summary Of The InyentiQn 

According to an illustrative embodiment of the 
invention, a bitstream quality analysis system includes a 
demultiplexer, a bitstream quality analyzer, and a graph- so 
ical user interface. The demultiplexer receives a bit- 
stream and separates the bitstream into at least one 
elementary bitstream that includes a video elementary 
bitstream. The bitstream quality analyzer receives the 
video elementary bitstream from the demultiplexer and 55 
parses the video elementary bitstream to extract param- 
eters characterizing the video elementary bitstream. 
The bitstream quality analyzer provides the extracted 



parameters to the graphical user interface, which dis- 
plays the extracted parameters characterizing the video 
elementary bitstream. 

According to an aspect of the invention, the bit- 
stream quality analyze- compares the extracted param- 
eters to parameter values allowed by the MPEG 
(MPEG-1 and MPEG-2) standard and notifies the user, 
via the graphical user interface when the video elemen- 
tary bitstream does not conform to the standard. Even 
when the video elementary bitstream does not conform 
to the MPEG standard, the bitstream quality analyzer 
may extract parameters and provide them to the graph- 
ical user interface for display, depending on the amount 
of non-conformance. 

According to another aspect of the invention, the 
bitstream quality analyzer includes an MPEG video 
decoder that receives the video elementary bitstream 
and selectively decodes the video elementary bitstream 
according to the level of monitoring and quality analysis 
selected by the user via the graphical user interface. 
Thus, if the user desires to view information pertaining 
to the various headers (sequence header, GOP header, 
etc), decoding is not performed at all and the bitstream 
quality analyzer simply parses information from the var- 
ious headers to provide to the graphical user interface 
for display. If the user desires to view quantized DCT 
coefficient values, then decoding is performed through 
the IVLC level, and if the user desires to view the DCT 
values themselves, then decoding is performed through 
the IQDCT level, rf the user desires to view the actual 
data, then full decoding is performed. Because the 
decoder selectively decodes the video elementary bit- 
stream to the level required by the user, considerable 
time and computational resources are saved. 

According to a further aspect of the invention, the 
bitstream quality analyzer analyzes information pro- 
vided by the video decoder and calculates various met- 
rics allowing the user to determine the quality of the 
encoding process. Based on decoded parameters 
obtained from decoding the elementary video bitstream 
to the IVLC level, the bitstream quality analyzer calcu- 
lates metrics pertaining to the amount of compressed 
data in the video decoder, how well the encoder esti- 
mated the size of the decoder buffer, how well the 
encoder estimated the bit rate, and whether coded bits 
are being used efficiently. 

According to another aspect of the invention, the 
bitstream quality analyzer further includes a com- 
pressed domain quality meter that predicts the quality of 
the fully decoded video sequence contained within the 
video elementary bitstream without fully decoding the 
video elementary bitstream and without using the origi- 
nal video sequence. The methods used to predict the 
quality of the fully decoded video sequence may be 
used on any video sequence that has been compressed 
according to a block DCT compression scheme, includ- 
ing MPEG-1 and MPEG-2, H.261. and H.263. 

Based on parameters obtained from decoding the 
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elementary video bitstream to the IVLC level, the com- 
pressed domain quality meter predicts the amount of 
spatial distortion in the fully decoded video sequence 
due to quantization of the DCT coefficients. Based on 
this same level of decoding, the compressed domain 
quality meter also predicts the amount of spatial distor- 
tion in the fully decoded video sequence that is capable 
of being perceived by a human observer. The com- 
pressed domain quality meter also predicts a third met- 
ric of the quality of the fully decoded video sequence 
that is indicative of the amount of motion prediction error 
in the encoding process. Although the third metric 
requires the inverse quantization of DCT coefficients, 
this step is performed only on the non -intra blocks, and 
does not require full decoding of the video elementary 
brtstream, nor does it require the original video 
sequence. 

Brief Description Of The Drawings 

Illustrative embodiments of the present invention 
are described by way of example with reference to the 
accompanying drawings, in which: 

Rg. 1 illustrates a conventional MPEG decoder; 
Fig. 2 illustrates various layers of an MPEG video 
sequence; 

Fig. 3 illustrates a simplified functional diagram of a 
conventional MPEG video decoder; 
Fig. 4 illustrates an example of a personal computer 
on which embodiments of the present invention can 
be implemented; 

Fig. 5 illustrates a high level functional block dia- 
gram of an MPEG video brtstream analysis system 
according to an embodiment of the present inven- 
. tion; 

Fig. 6 illustrates a high level functional block dia- 
gram of an MPEG video brtstream analysis system 
according to another embodiment of the present 
invention; 

Fig. 7 illustrates a functional block diagram of a bit- 
stream quality analyzer accorcSng to an embodi- 
ment of the present invention; 
Fig. 8 illustrates a functional block diagram of a 
compressed domain quality meter according to an 
embodiment of the present invention; 
Fig. 9 illustrates a top level view, as seen by a user, 
of a graphical user interface of a video bitstream 
analysis system according to an embodiment of the 
present invention; and 

Figs. 10-20 illustrate various views, as seen by a 
user, of a graphical user interface of a video bit- 
stream analysis system according to an embodi- 
ment of the present invention. 

Detailed Description 

According to a first aspect of the present invention, 



a bitstream quality analyzer is implemented in software 
on a personal computer (PC) turning the Windows NT™ 
operating system. An illustrative PC 100 in which the 
present invention can be implemented is shown in Fig. 
5 4. The PC 100 includes a processor 110, a memory 
120, a removable storage device 130, a display 150, a 
pointing device 160, and an input device 170, all cou- 
pled together via a bus 140. It should be understood 
that this PC 100 is shown merely for illustrative pur- 
io poses, and that the present invention is not limited to 
use with a system having this specific configuration, as 
other configurations are possible. For example, the 
memory 120 may consist of memory devices such as 
hard disk drives or optical disk drives, RAM, ROM, or 
75 other memory devices and combinations thereof. The 
input device 170 may include a keyboard that allows a 
user to interact with the system. The pointing device 
160 may, for example, be a puck, joystick, trackball, or 
mouse. The display 150 may be a CRT screen, or simi- 
20 lar device, that allows the user to monitor MPEG param- 
eters and visualize his/her interactions with the 
bitstream quality analyzer. The removable storage 
device 130 may be a CD-ROM drive, a tape drive, or a 
diskette drive. The removable storage device 130 can 
25 be used to load application software including the bit- 
stream quality analyzer software. 

Software, including code that implements the 
present invention, may be stored on some type of 
removable storage media such as a CD-ROM, tape, or 
30 diskette. The software can be copied to a permanent 
form of storage media on the PC (e.g., a hard disk) to 
preserve the removable storage media for back-up pur- 
poses. When the bitstream quality analyzer software of 
the invention is actually in use, the software is generally 
35 and at least partially stored in RAM, and is executed on 
the processor 110. 

Fig. 5 depicts a high level functional block diagram 
of an MPEG video bitstream analysis system according 
to an embodiment of the present invention. The MPEG 
40 video bitstream analysis system includes a line inter- 
face 175, a demultiplexer 180. a bitstream quality ana- 
lyzer 190, and a graphical user interface 200. The line 
interface 1 75 receives an MPEG bitstream and converts 
the MPEG bitstream into a format suitable for process- 
es ing by the demultiplexer 180. The demultiplexer 180 
separates the MPEG bitstream into various elementary 
bitstreams (audio, video, and/or multiple audio or multi- 
ple video elementary bitstreams) and provides the 
video elementary bitstream that is to be monitored to 
so the bitstream quality analyzer 1 90. The brtstream quality 
analyzer 190 parses and decodes the video elementary 
brtstream to extract various parameters and perform 
varying levels of quality analysis on that bitstream. Infor- 
mation concerning the monitored MPEG video elemen- 
ts tary brtstream. as well as varying levels of quality 
analysis, are displayed to a user via a Graphical User 
Interface (GUI) 200. In addition, through the GUI 200. 
the user is able to select the level of monitoring and 
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quality analysis to control the bitstream quality analyzer 
190. 

In current embodiments of the present invention, 
the line interface 175 can be any one of a synchronous 
parallel interface (SPI) card, a synchronous serial inter- s 
face (SSI) card, an asynchronous serial interface (ASI) 
card, or a G.703 interface card that is operatively con- 
nected to the PC's microprocessor bus. Because the 
actual implementation of the line interface 175 is 
dependent on the transmission medium on which the w 
MPEG bitstream is received, and because the imple- 
mentation of such a line interface 175 is known in the 
art, further description is omitted herein. Similarly, the 
actual demultiplexer 180 is implemented in software, 
and is based on the MPEG System Target Decoder is 
(STD) which is well documented in the MPEG standard. 
Therefore, further description of the demultiplexer 180 is 
omitted herein. 

The functional block diagram depicted in Fig. 5 sug- 
gests that the bitstream quality analyzer 190 is capable 20 
of keeping up with the video bitstream and operating in 
real time. Depending on the level of monitoring and the 
level of quality analysis desired, real time operation can 
be performed. Furthermore, various levels of monitoring 
and quality analysis can be performed in real time, 25 
depending on the resolution of the input video data, the 
input bit rate. etc. 

Fig. 6 depicts a high level functional block diagram 
of an MPEG video bitstream analysis system according 
to another embodiment of the present invention that is 30 
similar to the embodiment of Fig. 5, but which does not 
need to operate in real time. The MPEG video bitstream 
analysis system of Fig. 6 includes a line interface 175. a 
demultiplexer 180, a memory 120, a bitstream quality 
analyzer 1 90, and a Graphical User Interface (GUI) 200. 35 
As in Fig. 5, the line interface 175 converts the MPEG 
bitstream into a format suitable for processing by the 
demultiplexer 180. The demultiplexer 180 separates the 
MPEG bitstream into various elementary bitstreams 
(audio, video, and/or multiple audio or multiple video 40 
elementary bitstreams) and writes the video elementary 
bitstream that is to be monitored to the memory 120. 
Accordingly, the bitstream quality analyzer 190 can 
access the video elementary bitstream from memory, 
parse and decode the bitstream to extract various 45 
parameters, and perform varying levels of quality analy- 
sis of the video elementary bitstream to any level of 
detail desired by the user, as real time operation is not 
required. Various information concerning the monitored 
MPEG video elementary bitstream and various levels of so 
quality analysis are displayed to a user via the GUI 200. 
In addition, through the GUI 200, the user may select 
the level of monitoring and quality analysis to control the 
bitstream quality analyzer 190. For the same reasons 
mentioned with respect to Fig. 5. further description of ss 
the line interface 1 75 and the demultiplexer 1 80 is omit- 
ted. As can be appreciated, the memory 120 may be 
RAM. disk, or other type of wrrteabie storage. 



In broad functional overview, the bitstream quality 
analyzer 1 90 of Figs. 5 and 6 is a device that monitors a 
video transmission and analyzes the quality of that 
video transmission at varying levels of detail, selectable 
by a user via the GUI 200. As shown in Fig. 7, the bit- 
stream quality analyzer 190 includes an MPEG video 
decoder 210 and a Compressed Domain Quality Meter 
(CDQM) 220. The decoder 210 implements all of the 
functions of a conventional video decoder, including 
performing inverse variable length coding (IVLC) to 
expand the variable-length encoded data, performing 
inverse quantization of the quantized DCT coefficients 
(IQDCT) to obtain the DCT coefficients, and performing 
inverse DCT calculations (I DCT) and motion compensa- 
tion to reconstruct the pixel data. However, in contrast to 
conventional video decoders that receive a compressed 
video bitstream and output pixel data, the decoder 210 
of the bitstream quality analyzer 190 is capable of per- 
forming various levels of decoding, based on the level of 
monitoring and quality analysis selected by the user via 
the GUI 200. Within the bitstream quality analyzer 190. 
the CDQM 220 estimates various measures of distor- 
tion of the resulting video output (i.e. as pixel output) 
using only the compressed video bitstream. 

As functionally depicted in Fig. 7, decoding of the 
video bitstream is performed within the decoder 210 of 
the bitstream quality analyzer 190 in layers, dependent 
on the level of monitoring and quality analysis selected 
by the user via the GUI 200. Accordingly, if the user 
desires to view information pertaining to the various 
headers (sequence header, GOP header, etc), decod- 
ing is not performed at all and the bitstream quality ana- 
lyzer 190 simply parses information from the various 
headers to provide to the GUI 200. If the user desires to 
view quantized DCT coefficient values, then decoding is 
performed through the IVLC level, and if the user 
desires to view the DCT values themselves, then 
decoding is performed through the IQDCT level. If the 
user desires to view the actual data, then full decoding 
is performed. Because the bitstream quality analyzer 
190 can instruct the decoder 210 to perform varying 
degrees of partial decoding, considerable time and 
computational resources may be saved, depending on 
the level of monitoring and quality analysis selected by 
the user via the GUI 200. Thus, real time operation for 
certain levels of monitoring and quality analysis are pos- 
sible. 

The bitstream quality analyzer 190 provides metrics 
and analysis for three classes of quality. At class 1, 
MPEG Syntax and Semantics, the bitstream quality 
analyzer 190 analyzes the video bitstream for conform- 
ance with the applicable MPEG (MPEG-1 or MPEG-2) 
standard and identifies whether the video bitstream 
meets the specifications of the applicable MPEG stand- 
ard. At class 2. Dynamic Behavior, the bitstream quality 
analyzer 1 90 analyzes those parameters over which the 
designer of the encoder, or the broadcaster of the video 
bitstream had some choice, and displays information 
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allowing one to assess the wisdom of those choices. At 
class 3. Video Quality, the bitstream quality analyzer 
190 analyzes the quality of the resulting video output 
stream (i.e. pixel output) by providing estimates of vari- 
ous measures of distortion including mean square error, 
motion prediction error, and an objective metric that is 
based on human visual perception. Class 3 analysis is 
performed by the CDQM 220 within the bitstream qual- 
ity analyzer 190. 

At class 1 , the bitstream quality analyzer 190 deter- 
mines whether the video bitstream adheres to the appli- 
cable standard by analyzing header information, 
extension information, whether the appropriate end 
codes are present, etc., and comparing this information 
to the applicable standard. At class 1, the bitstream 
quality analyzer 190 examines both the syntax of the 
video bitstream and the semantics of the video bit- 
stream for compliance with the applicable standard. For 
example, syntactical information that is examined may 
include whether the required information is extracted 
from the expected position in the video bitstream (e.g., 
are start codes extracted and recognized when they are 
supposed to be extracted, or is there a start code at an 
unexpected position in the video bitstream) according to 
the syntax rules prescribed by MPEG. Semantic infor- 
mation that is examined may include whether actual val- 
ues of the various parameters are within minimum and 
maximum values established by the MPEG standard 
(e.g., the number of slices in a picture, whether values 
of dequantized DCT coefficients are within allowed val- 
ues, etc.) and whether the rules of the standard have 
been followed (e.g.. whether an I frame comes immedi- 
ately after a GOP header, etc). Information regarding 
the applicable MPEG standards is stored in memory 
120 of PC 100, 

If the video bitstream conforms to an MPEG stand- 
ard, then detailed information about the video bitstream 
is provided to the GUI 200 for display. For example, 
information that is provided to the GUI 200 for display 
may include whether the video bitstream is MPEG-1 or 
MPEG-2, actual parameter values within the various 
headers (sequence header, GOP header (if present), 
picture header, slice header, etc), extension information 
(sequence extension and picture coding extension (if 
MPEG-2)), as well as computationally based informa- 
tion such as the number of slices in a frame, etc. The 
level of detail of the information displayed to the user is 
selectable by the user via the GUI 200 and can require 
different levels of decoding, ranging from information 
that requires only parsing of the header information, to 
information that requires performance of inverse varia- 
ble length coding (IVLC) and inverse quantization of the 
DCT coefficients (IQDCT). 

The bitstream quality analyzer 190 is also able to 
monitor and analyze certain video bitstr earns that are 
not compliant with MPEG standards. For example, 
MPEG encoders that were developed before a standard 
was frozen, and which may not be fully compliant can, if 



sufficiently well known, be recognized and the various 
parameter values displayed via the GUI 200. The extent 
of non-compliance may be semantic or syntactic, or 
both. In the embodiment of Fig. 6, even if the non-com- 

s pliant video bitstream isn't recognized, there is no 
requirement for real-time decoding, and thus the bit- 
stream quality analyzer can monitor non-compliant bit- 
streams that could not be decoded in real-time due to a 
variety of reasons. When the non-compliant video bit- 

io stream can be recognized or decoded, a dialog box is 
presented to the user for each incorrect parameter, with 
an informational message explaining the nature of the 
non-compliance. Moreover, all such errors are written to 
a log file for future reference and diagnosis. If the video 

is bitstream is not compliant with MPEG standards, and it 
can not be recognized, then an informational message 
stating this fact is displayed by the GUI 200. In a current 
embodiment of the invention, a dialog box is presented 
to the user indicating that decoding of the video bit- 

so stream was not possible, and the most likely reason for 
non-compliance (e.g., an un-recognized start code at 
some point in the video bitstream). Of course, it should 
be realized that the bitstream quality analyzer 190 may 
be modified to monitor and analyze other types of brt- 

25 streams, like H.261 and H.263 bitstreams, each of 
which are structurally quite similar to MPEG and include 
video elementary bitstreams that are syntactically simi- 
lar to those of MPEG. 

At class 2, the bitstream quality analyzer 190 ana- 

30 lyzes those parameters over which the designer of the 
encoder, or the broadcaster of the video bitstream had 
some choice, and provides information to the GUI 200 
allowing one to assess the wisdom of those choices. For 
example, at class 2, the bitstream quality analyzer 190 

35 provides the minimum size of the compressed video 
data buffer that the encoder believed sufficient to fully 
decode the video bitstream data (based on 
vbv_buffer_size and vbv_buffer_size_extension), and 
provides, on a frame-by-frame basis, various indicia of 

40 buffer fullness. Based on the STD model, the type of 
frame (I, P, or B), and the amount of compressed video 
data in the various types of frames, the bitstream quality 
analyzer 190 calculates the amount of compressed 
video data in the compressed video data buffer and 

45 updates this calculation each frame. The bitstream 
quality analyzer 190 also calculates a second metric of 
buffer fullness based on the amount of compressed 
video data in the compressed video data buffer and the 
amount of time the encoder believed a particular frame 

so would be stored within the compressed video data 
buffer before that compressed data would be displayed 
and discarded (based on vbvjdelay), and updates this 
calculation each frame. Although these two metrics 
should ideally yield the same result, in practice they do 

55 not, and hence, the bitstream quality analyzer 190 also 
calculates a difference between these two metrics and 
updates this difference calculation each frame. The dif- 
ferent types of buffer information that is displayed via 



<EP 0669684A2_I_> 



6 



11 

the GUI 200 readily allows one to identify if there is a 
risk of buffer underflow or overflow, and to determine 
how well the encoder estimated the minimum size 
required for the compressed video data buffer and how 
well the encoder estimated the amount of time that 5 
frames would remain in the compressed video data 
buffer. Furthermore, each one of these metrics of buffer 
fullness allow one to assess the real-time decidability 
of the video elementary bitstream. 

In addition to providing various metrics relating to 10 
buffer status, the bitstream quality analyzer 190 pro- 
vides information identifying the type of frame (l ( P, or 
B), the amount of compressed video data in each type 
of frame, and the amount of bit stuffing (used by the 
encoder to achieve the target frame rate) in each type of 15 
frame. This information is provided to the QUI 200 on a 
frame-by-frame basis for each of the most recent 
frames, and allows the user to compare the amount of 
useful compressed video data in a particular frame to 
the amount of bit stuffing in that particular frame. Based 20 
on this information, the user can determine whether 
encoded bits are being used efficiently. This information 
also allows the user to get a rough indication of the 
expected resolution of the particular frame, as the reso- 
lution of the frame may be expected to be better when 25 
the amount of useful compressed data is high in relation 
to the amount of bit stuffing. The bitstream quality ana- 
lyzer 190 also identifies the target bit rate set by the 
encoder and calculates a smoothed average value of 
the actual bit rate based on the amount of compressed 30 
video data received by the decoder 210 per unit time. 
This bit rate information identifies how well the encoder 
met the target bit rate. Accordingly, at class 2, the bit- 
stream quality analyzer provides various metrics allow- 
ing the user to readily assess the quality of the video 35 
bitstream. 

At class 3, estimates of the quality of the resulting 
video output stream (i.e. pixel output) are provided 
using various measures of distortion, including mean 
square error, motion prediction error, and an objective 40 
metric based on human visual perception. The different 
estimates of the quality of the resulting video output 
stream are performed by the COQM 220. These esti- 
mates require varying levels of decoding and are 
selectable by the user through the QUI 200. For exam- 45 
pie, as shown in Fig. 8, the estimation of the mean 
square error and the objective measure of distortion 
based on human visual perception require decoding of 
the video bitstream to only the IVLC level, while the esti- 
mation of the motion prediction error requires decoding so 
to the DCT coefficient level (i.e. performing IVLC and 
IQDCT). 

Regardless of which measure of quality is selected, 
none of the measures requires the full decoding of the 
video bitstream to the pixel level. Furthermore, none of ss 
the measures of quality requires the presence of the 
original input bitstream, prior to encoding, to judge qual- 
ity, as the original input bitstream may be unavailable at 
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the decoder. Accordingly, the analysis performed by the 
CDQM 220 may be performed either at a transmitting 
station or a receiving station, or both, to estimate the 
quality of the resulting output video bitstream in minimal 
time, with minimal processing, and without requiring the 
original input video bitstream prior to encoding. Thus, a 
broadcasting station may use the information to save 
bandwidth if the quality of the video output data is higher 
than necessary, and to increase the bandwidth of the 
transmitted signal if the quality of video output data is 
poor. 

The CDQM 220 provides three different estimations 
predicting the amount of distortion in the resulting video 
output bitstream due to the encoding process. The first 
estimation predicts the amount of spatial distortion of 
the resulting video output stream due to quantization 
error, and is based on a mean square error analysis. 
The quantization error for a single DCT coefficient (in 
the DCT domain) is given by Eq. (1): 

Q erwr =\DCT r IQDCT ,| (1) 

where DCT f is the original value of the Vth coefficient 
before quantization and IQDCTj is the value of the Vth 
coefficient after inverse quantization. However, this 
exact error analysis is of little use if the original coeffi- 
cient value before quantization (DCTj) is not available. 
Thus, the quantization error for a single DCT coefficient 
(DCTj ) of a coded block e of a macroblock (in the DCT 
domain) is instead approximated as shown in Eq. (2): 

O^or^l^i (2) 

where Aj is the quantization step (the number by which 
the DCT coefficient. DCTj, was quantized) for the Vth 
coefficient, and Aj = M*Q j( where M is the current 
quantization scale factor and Qj is the quantization 
value (from the quantization matrix) of the "i"th coeffi- 
cient. Thus, in the pixel domain, the Mean Square Error 
(MSE) of a single coded block e of a macroblock is 
given by Eq. (3): 

64 o 64 

e=/V£ IDC^MQjY^N^ ^IDCT(Q f ) 2 (3) 
i-i / = 1 

where N represents the number of blocks actually 
coded for that macroblock. The MSE is computed sepa- 
rately for luminance (Y) and chrominance (C p C&) com- 
ponents. The value of N may change when one 
computes the MSE for the luminance signal versus the 
chrominance signal, and may also change depending 
on whether all the blocks of a given macroblock are 
coded. Thus, N represents the number of blocks that 
are actually coded in a given macroblock. The quantity 
IIDCT(Qj) 2 from i= 1 to 64 can be computed once for 
each of the quantization matrices (intra and non-intra). 
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Furthermore, because the quantization matrices can 
change no more frequently than once each frame, this 
computation does not need to be performed very fre- 
quently. 

Although the previously described estimation pro- 
vides a useful prediction of the spatial distortion of 
resulting video output stream due to quantization error, 
further improvement is possible. Rate-distortion theory 
provides that the distortion is inversely related to the bit 
rate; that is, the more bits per unit time used in repre- 
senting the compressed video data, the less the distor- 
tion, and vice versa. Analogously, the more bits used in 
coding the DCT coefficients of a given block, the less 
the expected distortion, and vice versa. Based on this 
relationship, a refined MSE estimate uses a linear com- 
bination of the previous estimation (Eq. 3) and a signal 
that is the inverse of the number of bits that have been 
used in coding the DCT coefficients. The number of bits 
used in coding DCT coefficients is easily determined 
when parsing the bitstream. However, this number var- 
ies significantly from one block to another, and even 
from one macroblock to another. Accordingly, based on 
a regression analysis of empirical data, the refined MSE 
estimate uses a weighted sum of the MSE estimate 
given by Eq. (3). the inverse of the number of coded 
DCT bits per frame filtered by length-3 median fitter, and 
the inverse of the number of coded DCT bits per frame 
filtered by a length-4 median filter. The number of bits 
used to code the DCT coefficients of each frame are 
summed and the sums of the number of coded bits per 
frame are filtered using length-3 and length-4 median fil- 
ters. The median filters take the median sum of the last 
3 frames (length-3) and the last 4 frames (length-4), 
including the current frame, to smooth variations in the 
number of bits being used to code DCT coefficients. 
The median sums are then weighted and added to a 
weighted value of the MSE estimate. 

The refined MSE estimate is calculated by the 
CDQM 220 for each macroblock, and is provided to the 
GUI 200 for display by the bitstream quality analyzer 
190. The GUI 200 displays the refined MSE estimate on 
both a frame-by-frame basis, and at the macroblock 
level for the current frame. Prior to display at the mac- 
roblock level, the refined MSE estimate for each mac- 
roblock is smoothed by spatially averaging the refined 
MSE estimate for a given macroblock across its neigh- 
boring macroblocks, to better reflect the distortion that 
would appear to the user. 

The second estimate of the predicted amount of 
distortion of the resulting video output bitstream is 
based on the refined MSE estimate, and incorporates a 
simple model of human visual perception. This metric 
accounts for pattern sensitivity (i.e.. the sensitivity of the 
human eye as a function of spatial and temporal fre- 
quency) and multiplies the error spectrum by a contrast 
sensitivity function (CSF) of the human eye and com- 
putes its energy. The CSF measures the sensitivity of 
the visual system to stimuli as a function of their fre- 



quency The estimated amount of distortion based on 
quantization error, e(f x , f y , fj, can be determined for 
each macroblock of each frame based on the refined 
MSE estimate, where 1 X% f y are spatial frequency coordi- 
5 nates, and // is the temporal frequency. A sensitivity 
based distortion measure e 5 is given by Eq. (4): 



e s = J \\S(i x J y J t )\e(f x J r f t )\ 2 (4) 

where S(f x , f y represents the CSF of the human eye. 
and e(f x , f y frf 2 is similar to the refined MSE, but in the 
frequency domain. Because Eq. (4) is defined in the fre- 

15 quency domain, and the refined MSE is calculated in 
the pixel domain, the CDQM 220 instead implements 
Eq. (4) as a convolution in the pixel domain with a finite 
impulse response cfigrtal filter which approximates the 
CSF. The actual CSF that is used to determine this sec- 

20 ond estimate of distortion is based on the spatio-tempo- 
ral surface described in "Spatiotemporal Characteristics 
of Visual Mechanisms: Excitatory-Inhibitory Model", by 
Christina A. Burbek and D.H. Kelly, Journal of the Opti- 
cal Society of America . Vol. 70, No. 9. pp. 1121-1126, 

25 September 1980. The impulse response of the digital fil- 
ter is then obtained by windowing using Kaiser windows 
of length 16, according to well known signal processing 
techniques. Of course, it should be realized that other 
contrast sensitivity functions and different signal 

30 processing techniques may be used. Similar to the 
refined MSE estimate, this objective metric (i.e., the 
second estimate) is calculated by the CDQM 220 for 
each macroblock and is provided to the GUI 200 for dis- 
play by the bitstream quality analyzer 190. The objective 

35 metric, if selected by the user, is displayed on a frame- 
by-frame basis, and at the macroblock level for the cur- 
rent frame. 

The third estimate of the predicted amount of distor- 
tion of the resulting video output bitstream is also based 

40 on the refined MSE estimate and predicts the amount of 
distortion of the resulting video output bitstream due to 
motion prediction error (also termed "Displaced Frame 
Difference" (DFD)). The DFD is transmitted in a quan- 
tized form to the decoder for the non-intra blocks and 

45 represents the error of the motion prediction performed 
by the encoder (i.e.. what could not be predicted). 
Accordingly, H DFD[n-,, n£ represents the motion pre- 
diction error signal in the pixel domain prior to quantiza- 
tion (n 1( n 2 represent pixel coordinate axes, e.g. x, y), 

so and DFD c [n 1( nj represents the same signal (also in 
the pixel domain) after quantization (and IQDCT and 
IDCT), information describing DFDJn 1( n^ is contained 
in the bitstream in the form of DCT coefficients in the 
non-intra blocks. The difference between DFD[n-| , nj 

55 and DFD c [n 1 , nj is equal to the quantization error e[n^ . 
nj. which can be obtained from the refined MSE esti- 
mation. Accordingly, the distortion due to motion predic- 
tion error can be estimated (in the pixel domain) using 
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Eq. (5). 

DFD[n v n 2 ) = DFDc[n u n 2 ]+ e[n^n 2 ) (5) 

Because the DCT is a unitary transform, computa- 
tion of the energy of DFD[n 1t nj, (i.e., the amount of 
motion prediction error) can be easily carried out in the 
frequency domain by directly using the samples of the 
DCT coefficients of the non-intra blocks. Thus, an esti- 
mate of the amount of motion prediction error can be 
determined. macroHock-by-macroblock, based on the 
refined MSE. estimate and DCT coefficients of the non- 
intra blocks, without having to compute the IDCT This 
knowledge of the motion prediction error gives useful 
insight on the performance of motion estimation by the 
encoder. Similar to the refined MSE estimate and the 
second estimate (objective metric), the DFD is calcu- 
lated by the CDQM 220 for each macroblock and is pro- 
vided to the GUI 200 for display by the bitstream quality 
analyzer 190. The DFD estimate, if selected by the user, 
is displayed on a frame-by-frame basis, and at the mac- 
roblock level for the current frame. 

Although the previously descrfoed estimations of 
distortion were based on the approximations of Eq. (3), 
there are other methods of estimating the amount of 
spatial distortion of the resulting video output stream 
due to quantization error. Based on previous work by 
Gregory S. Yovanoff and Sam Liu (described in "Statis- 
tical Analysis of the DCT Coefficients and their Quanti- 
zation Error", Proceedings of the 30th Asilomar 
Conference on Signals. Systems, and Computers. 
1996) on the distribution of JPEG DCT coefficient distri- 
bution, it can be assumed that MPEG DCT coefficient 
distribution can also be estimated based on a statistical 
modeling of the DCT coefficient distribution and the dis- 
tribution of the quantization error. For example, assum- 
ing that we have an estimate of the DCT coefficient 
distribution by knowing the probability density function 
(pert), fj(x)), of each coefficient, where i is the index of 
the DCT coefficient, Yovanoff and Liu have shown that 
this pdf can be approximated by a generalized gaussian 
function (GGF). Denoting Qj as the quantized value for 
the considered DCT coefficient x, of index /, (i.e., after 
quantization at encoder and inverse quantization at 
decoder) the quantization error can be expressed as 
(since Qj is known from the bitstream): 



e= J (x-G,) 2 /,(x|Q,)dbr 



(6) 



where Aj is the quantization step for the considered 
coefficient (which is known from the bitstream) and x is 
the actual value of the original DCT coefficient corre- 
sponding to Qj, before quantization. The conditional 



10 



probability density function fj(x|Qj) in Eq. (6) is related to 
the previous pdf by Eq. (7): 



M*|G/) = 



fil*) 



(7) 



J fMdx 



Qi 



;'2 



where fj(x) is the distribution of DCT coefficients before 
quantization. The quantization error can thus be esti- 
mated, given an estimation of the pdf f s (x). A fixed esti- 

15 mate of the pdf fj(x) that is based on measurements of 
some original video sequences can be used, or alterna- 
tively, the pdf f j(x) can be estimated during the decoding 
process. The second method may potentially provide a 
better estimation of the pdf for the considered sequence 

20 and may be determined during the decoding process by 
measuring the histogram of the quantized DCT coeffi- 
cients and estimating the parameters of the GGF by 
maximum likelihood estimation. 

Fig. 8 illustrates the functional relationship of the 

25 CDQM 220 and the decoder 210 within the bitstream 
quality analyzer 190, and the varying levels of decoding 
that are required to predict the different estimates of dis- 
tortion of the resulting video output. As can be seen, the 
calculation of the refined MSE and the Objective metric 

30 only require IVLC decoding (unless user defined quan- 
tization matrices are present), while the DFD error esti- 
mation further requires IQDCT decoding. Given a 
typical MPEG video bitstream, none of the aforemen- 
tioned predictions of distortion error requires IDCT cai- 

35 culations. In addition, none of the predictions of 
distortion requires the original input video bitstream 
prior to encoding. 

It should be apparent to one of ordinary skill in the 
art that the methods used by the CDQM 220 to predict 

40 distortion and described above are valid for any block 
DCT compression scheme, including motion compen- 
sated block DCT compression schemes like MPEG-1 
and MPEG-2, and H.261. H.263, etc. Furthermore, it 
should also be apparent that a bitstream quality ana- 

45 tyzer capable of monitoring and analyzing these other 
types of video streams can be implemented. 

Fig. 9 illustrates a display screen, as seen by a 
user, of a bitstream analysis system similar to the one 
depicted in Fig. 6 when the bitstream quality analyzer 

so 190 is executing. At the top level, the GUI 200 presents 
the user with an Analysis Control Panel window 300 in 
which various information is monitored and displayed, 
various control functions may be selected, and in which 
varying degrees of monitoring and analysis may be 

55 selected. The Analysis Control Panel window 300 
includes a number of pull-down menus including a File 
menu 305, an Analysis menu 310 and a Help menu 315. 
The File menu 305 includes options allowing the user to 
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Open a particular file, for example, a file containing a 
video bitstream that has been written to memory 120, or 
to Exit the bitstream quality analyzer 1 90. Other options 
within the File menu 305 (not shown) may, of course, 
include deleting a file, renaming a file, etc. The Analysis 5 
menu 310 includes varying levels of analysis that may 
be selected by the user, including Macroblock Analysis, 
Buffer Analysis. CDQM Mean Square Error Analysis, 
and CDQM Displayed Frame Difference Analysis. 
Options within the Help menu 315 are not shown, but 10 
would of course, include information describing the var- 
ious options and the operation of the bitstream quality 
analyzer 1 90. 

Within the Analysis Control Panel window 300, 
users can select what information is displayed at the is 
frame level or the group of pictures level by accessing 
the control function buttons 320. By default, the top level 
of the Analysis Control Panel window 300 includes vari- 
ous fields describing the monitored video bitstream 
including a General Information field 325, a Sequence 20 
Header field 330, a GOP Header field 335, a Picture 
Header field 340, and a Slice Header field 345. The 
information displayed within the General Information 
field 325, the Sequence Header field 330, the GOP 
Header field 335. and the Picture Header field 340 is 25 
based on information parsed from the appropriate 
headers and needs no further explanation. As shown in 
Fig. 9, the Slice Header field 345 includes minimum and 
maximum quantization scale values for all the slices in 
the particular frame being monitored, as well as the total 30 
number of slices within the particular frame. Were the 
frame to include only one slice, then the minimum and 
maximum quantization scale values would be the same. 
Also depicted within the Slice Header field 345 are the 
quantization scale values, by slice, for each of the slices 35 
in the particular frame. Additional quantization scale val- 
ues, beyond the six slices shown, can be viewed by 
scrolling downward. It should be emphasized that all the 
information displayed in the top level of the Analysis 
Control Panel window 300 and described thus far is 40 
obtained by parsing the compressed video bitstream 
and without any decoding. Accordingly, this information 
requires negligible time to display. 

The Analysis Control Panel window 300 also pro- 
vides the user with the ability to display the motion pic- 45 
ture represented by the video bitstream by selecting 
Display Video 350. However, it will of course be realized 
that the display of the video requires full decoding of the 
compressed video bitstream. An example of such a 
selection is shown in Fig. 10, in which a viewing area so 
355 is presented to the user via a Video Title window 
360. Although not shown, the motion picture is dis- 
played in the viewing area 355 and is controlled by the 
control function buttons 320 (Fig. 9). 

Figs. 11 and 12 illustrate Sequence Extension and ss 
Picture Coding Extension drop down menu windows 
365, 370 that are viewable from within the Analysis Con- 
trol Panel window 300 by selecting Extension Informa- 



tion within the Sequence Header field 330, or Picture 
Coding Extension within the Picture Header field 340, 
respectively. 

Fig. 13 illustrates a Bit/Buffer Analysis window 375 
of the video bitstream when the user selects Buffer 
Analysis within the Analysis pull-down menu 310. As 
shown in the Buffer Analysis portion 380 of the 
Brt/Buffer Analysis window 375, the GUI 200 displays, 
on a frame-by-frame basis, curves representing Buffer 
Size 385 (based on vbv_buffer_size and 
vbv_buffer_size_extension), Std. Decoder Buffer Status 
390 (based on the STD Decoder model), an estimate of 
the vbv delay (VBV Delay Estimate 395), and an Esti- 
mated Buffer Occupancy Difference 400, all relating to 
the state of the compressed video data buffer of the 
decoder 210. The Estimated Buffer Occupancy Differ- 
ence 400 represents the difference between Std. 
Decoder Buffer Status 390 and VBV Delay Estimate 
395. As can be seen from Fig. 1 3, the information that is 
displayed allows one to quickly view the fullness of the 
compressed video buffer using two different metrics, to 
identify risks of buffer under flow or overflow, to assess 
the efficiency of buffer utilization by the encoded video 
bitstream, and to assess the real-time decodability of 
the encoded video bitstream. 

The Bit Analysis portion 405 of Fig. 13 displays two 
different metrics regarding bit usage. The lower left por- 
tion 410 of the Bit Analysis portion 405 displays the 
number of bits and the frame type of the most recent 
frames, with the darkened areas representing the 
number of stuffed bits in each of the most recent frames. 
The lower right portion 415 of the Bit Analysis portion 
405 displays the target bit rate 420 (based on sequence 
header and sequence extension information) as com- 
pared to a smoothed average value of the actual bit rate 
425. 

Fig. 14 illustrates a Macroblock Analysis window 
430 for the video bitstream when the user selects Mac- 
roblock Analysis within the Analysis pull-down menu 
310. A wealth of information is available to the user 
depending on the Display Options 435 selected by the 
user. Region 440 of the Macroblock Analysis window 
430 shows various types of macroblocks and the per- 
centage of macroblocks of that type within the current 
frame. Viewing area 440 corresponds to the viewing 
area 355 in Fig. 10 in which the video is displayed. 

Fig. 15 illustrates a Display Options pop-up window 
455 that is displayed to the user when the Display 
Options button 435 is selected in the Macroblock Analy- 
sis window 430 shown in Fig. 14. The various display 
options that may be selected by the user include dis- 
playing Macroblock Type, macroblock quantization 
value MQUANT, macroblock Motion Type, and macrob- 
lock Motion Vector(s). Similarly, the user may select the 
granularity of bit allocation analysis, which, as depicted 
in the Bits per Macroblock legend 450 of the Macroblock 
Analysis window 430 in Frg. 14, controls granularity of 
the shading in the Macroblock Analysis window 430. 
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The actual information displayed in Fig. 14 corresponds 
to a selection of Macroblock Type only. Depending upon 
the display options selected by the user, the level of 
decoding that is performed by the bitstream quality ana- 
lyzer 190 and displayed by the GUI 200 will vary from 
none to decoding through the IVLC level. 

Fig. 1 6 shows an Additional Macroblock Information 
pop up window 460 that is displayed to the us©- when 
the Additional MB Info button 445 is selected in the Mac- 
roblock Analysis window 430 shown in Fig. 14. The 
information contained in this window is self explanatory 
and requires IVLC decoding and IQDCT decoding. 

Fig. 17 shows a Spatial Distortion Estimation Anal- 
ysis window 465 when the user selects CDQM MSE 
Analysis within the Analysis pull-down menu 310. As 
illustrated, the left hand portion 470 displays the refined 
MSE 480 estimation of spatial distortion and the Objec- 
tive Measure 485 estimation of spatial distortion, on a 
frame-by-frame basis, for the luminance components, 
with a Current Frame marker 490 depicting the current 
frame. The right hand portion 475 illustrates spatial dis- 
tortion of the current frame (as indicated by the Current 
Frame marker 490) at a macroblock level of detail using 
the refined MSE measure of spatial distortion, although 
the Objective Measure could have been selected 
instead via the Display Options button 495. As men- 
tioned previously, prior to display at the macroblock 
level, the refined MSE estimate for each macroblock is 
smoothed by spatially averaging the refined MSE esti- 
mate for a given macroblock across its neighboring 
macroblocks, to better reflect the distortion that would 
appear to the user. 

Fig. 18 illustrates a Spatial Distortion Display 
Options window 500 when the Display Options button 
495 is selected in the Spatial Distortion Estimation Anal- 
ysis window 465. The Spatial Distortion Display Options 
window 500 allows the user customize the vairious spa- 
tial distortion metrics to their particular requirements. 

Fig. 19 illustrates a Displaced Frame Difference 
Predicted Error Analysis window 505 when the user 
selects CDQM DFD Analysis within the Analysis pull- 
down menu 310. As illustrated, the left hand portion 510 
displays the displaced frame difference estimation of 
motion prediction Distortion 520 on a frame-by-frame 
basis, with a Current Frame marker 525 depicting the 
current frame. The right hand portion 515 illustrates the 
estimated motion prediction Distortion of the current 
frame (as indicated by the Current Frame marker 525) 
at a macroblock level of detail. The various display 
options that are selectable by the user are illustrated in 
Fig. 20 in a DFD Predicted Error Display Options win- 
dow 535 by selecting the Display Option button 530 in 
the Displaced Frame Difference Predicted Error Analy- 
sis window 505 of Fig. 19. 

Having thus described at least one illustrative 
embodiment of the invention, various alterations, modi- 
fications, and improvements will readily occur to those 
skilled in the art Such alterations, modifications, and 



improvements are intended to be within the spirit and 
scope of the invention. Accordingly, the foregoing 
description is by way of example only and is not 
intended as limiting. The invention is limited only as 
5 defined in the following claims and the equivalents 
thereto. 

Claims 

io 1. A bitstream quality analysis system comprising: 



a demultiplexer (180) to receive a bitstream 
and separate from the bitstream at least one 
elementary bitstream, the at least one elemen- 
tary bitstream including a video elementary bit- 
stream; 

a bitstream quality analyzer (190), operatively 
coupled to the demultiplexer, to receive the 
video elementary bitstream from the demulti- 
plexer and parse the video elementary bit- 
stream, the bitstream quality analyzer 
extracting parsed parameters characterizing 
the video elementary bitstream; and 
a graphical user interface (200). operatively 
coupled to the bitstream analyzer, to receive 
the parsed parameters from the bitstream qual- 
ity analyzer and display the parsed parameters. 
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2. The bitstream quality analysis system of claim 1 , 
wherein a level of analysis performed on the video 
elementary bitstream is selected by a user, and 
wherein the bitstream quality analyzer includes: 

an MPEG video decoder (210) that is opera- 
tively coupled to the graphical user interface, 
the video decoder receiving the video elemen- 
tary bitstream and selectively decoding the 
video elementary bitstream in layers, to a layer 
of decoding that corresponds to the selected 
level of analysis, the video decoder providing 
decoded parameters to the graphical user 
interface corresponding to the layer ol decod- 
ing performed by the video decoder. 

The bitstream quality analysis system of claim 2, 
wherein the bitstream quality analyzer further 
includes: 

a compressed domain quality meter (220) that 
is operatively coupled to the video decoder and 
the graphical user interface, the compressed 
domain quality meter receiving the decoded 
parameters corresponding to the layer of 
decoding performed by the video decoder and 
calculating at least one metric of an amount of 
distortion in a fully decoded video scene that is 
contained in the video elementary bitstream 
based on the decoded parameters and without 
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full decoding by the video decoder, the com- 
pressed domain quality meter providing the at 
least one metric to the graphical user interface 
for display. 

A method of monitoring a brtstream comprising the 
steps of: 



mentary video brtstream due to the quantiza- 
tion error and capable of human visual 
perception; and 

displaying the distortion metric. 

The method of claim 4, further corrprising the steps 
of: 



separating from the brtstream at least one ele- 
mentary bitstream, the at least one elementary 10 
brtstream including a video elementary bit- 
stream; 

parsing the video elementary brtstream to 
extract parameters characterizing the video 
elementary bitstream; and is 
displaying the extracted parameters on a dis- 
play. 



The method of claim 4, further comprising the steps 
of: 



The method of claim 4, further comprising the steps 
of: 



20 



variable length decoding the video elementary 
bitstream to provide decoded data that is varia- 
ble length encoded in the video elementary bit- 
stream; 25 
determining a quantization scale of a macrob- 
lock contained in the elementary video bit- 
stream based on one of the extracted 
parameters and the decoded data; 
calculating a first distortion metric indicative of 30 
an amount of distortion in the elementary video 
bitstream due to quantization error based upon 
the quantization scale of the macroblock and 
quantization values of DOT coefficients within 8. 
the macroblock; and 35 
displaying the first distortion metric on the dis- 
play. 



40 



variable length decoding the video elementary 
brtstream to provide decoded data that is varia- 
ble length encoded in the video elementary bit- 
stream; 45 
determining a quantization scale of a macrob- 
lock contained in the elementary video bit- 
stream based on one of the extracted 
parameters and the decoded data; 
calculating an amount of distortion in the ele- so 
mentary video bitstream due to quantization 9. 
error based upon the quantization scale of the 
macroblock and quantization values of DCT 
coefficients within the macroblock; 
multiplying the amount of distortion by a con- ss 
trast sensitivity function of human visual per- 
ception to calculate a distortion metric 
indicative of the amount of distortion in the ele- 



variable length decoding the video elementary 
bitstream to provide decoded data that is varia- 
ble length encoded in the video elementary bit- 
stream; 

determining a quantization scale of a predicted 
macroblock contained in the elementary video 
bitstream based on one of the extracted param- 
eters and the decoded data; 
calculating an amount of distortion in the ele- 
mentary video brtstream due to quantization 
error based upon the quantization scale of the 
predicted macroblock and quantization values 
of DCT coefficients within the predicted mac- 
roblock; 

inversely quantizing DCT coefficients within the 
predicted macroblock to provide dequantized 
DCT coefficients; 

calculating an amount of distortion in the ele- 
mentary video bitstream due to motion predic- 
tion error based on the amount of distortion 
and the dequantized DCT coefficients; and 
displaying the amount of distortion in the ele- 
mentary video bitstream due to motion predic- 
tion error on the cfi splay. 

A computer controlled method of predicting distor- 
tion in a video sequence that is compressed by a 
block DCT compression scheme, the method com- 
prising the steps of: 

receiving an encoded video bitstream contain- 
ing a compressed video sequence correspond- 
ing to the video sequence; 
variable length decoding the encoded video bit- 
stream to extract the compressed video 
sequence; and 

calculating an amount of distortion in the video 
sequence due to quantization of DCT coeffi- 
cients based on the extracted compressed 
video sequence and without decompressing 
the compressed video sequence. 

The computer controlled method of claim 8, further 
comprising the steps of:. 

dequantizirtg DCT coefficients contained within 
the compressed video sequence; and 
calculating an amount of distortion in the video 
sequence due to motion prediction error based 
on the dequantized DCT coefficients and the 
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amount of cfi stortion in the video sequence due 
to the quantization of OCT coefficients. 

1 0. The compute - controlled method of claim 8, further 
comprising steps of: s 

calculating a probability density function of 
DCT coefficient distribution for each DCT coef- 
ficient of the DCT coefficients; and 
calculating a conditional probability density io 
function of DCT coefficient quantization error 
based on the probability density function of 
DCT coefficient distribution; 

wherein the step of calculating the 
amount of distortion in the video sequence due 15 
to quantization of DCT coefficients calculates 
the amount of distortion based on the extracted 
compressed video sequence and the condi- 
tional probability density function and without 
decompressing the compressed video 20 
sequence. 
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